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ABSTRACT 



First Steps is a program instituted by the Western Australia 
Ministry of Education to improve the literacy and numeracy of primary school 
students, particularly those at risk of academic failure. First Steps is 
organized around reading, writing, spelling, and oral language; children's 
skills are organized along developmental continua. These continue are an 
ordered series of statements describing behaviors children display as they 
move toward literacy. The Spelling Developmental Continuum was revised as a 
response to a study of teacher assessments of the applicability of the 
continuum to their students and to teacher responses to a questionnaire about 
the continuum. The revised version was studied, and a revalidation study 
showed great improvements in the wording and acceptance of the continuum. 

This report describes, in detail, the changes made to the 1992 version of the 
Spelling Development Continuum, and records the effects of these changes as 
measured in the revalidation of the continuum. The 1993 revision seems to tap 
spelling ability better than the prior version did, and it groups indicators 
into phrases with improved precision. In addition, the item response theory 
methodology was useful in improving the Spelling Development Continuum. An 
appendix contains the indicator code numbers and text for both versions of 
the continuum. (SLD) 
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A DESCRIPTION AND AN ASSESSMENT OF THE CHANGES MADE TO THE 1992 
VERSION OF THE FIRST STEPS SPELLING DEVELOPMENT CONTINUUM AND AN 
ASSESSMENT OF THE ROLE OF ITEM RESPONSE THEORY IN THESE CHANGES 



Report to the 

Curriculum Development Branch 
Western Australian Ministry of Education 






Australian Council for Educational Research, January 1994. 



SUMMARY OF THE MAIN HNDINGS 



1. Generally, rewording indicators with a poor Infit Mean Square has led to an 
improvement in the Infit Mean Square of these indicators. 

2. The 1993 version of the First Steps Spelling Development Continuum more 
consistently taps spelling ability than the 1992 version, probably as a result of the 
changes made to the text of some indicators. 

3- The 1993 version of the First Steps Spelling Developmental Continuum groups 
indicators into phases with improved precision. There is less overlap between phases 
for both indicators and key indicators. 

4. Item Response Theory seems to have been useful in helping to improve the First 
Steps Spelling Developmental Continuum. 
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Editor’s Note 



This document is one of a series of reports that document the formative research that supported 
the creation and development of First Steps™. As a result of this research, the Education 
Department of Western Australia (EDWA), in collaboration with the Australian Council for 
Educational Research (ACER) revised First Steps in response to each of the issues and questions 
raised by this research. First Steps training courses, Developmental Continua, and Resource 
Books are published with due amendments and alterations. 

Other research documents that support the development of First Steps include: 

Dr. Phil Deschamp: 

♦ A Survey of the Implementation of the Literacy Component of the First Steps Project in WA 

♦ The Implementation of The Literacy Component of The First Steps Project in ELAN Schools 

♦ A Survey of the Effectiveness of the Focus Teacher ‘B’ Training for the First Steps Project 

♦ Student Achievement: A Study of the Effects of First Steps Teaching on Student 
Achievement 

♦ Case Studies of The Implementation of the First Steps Project in Twelve Schools 

♦ The Development and Implementation of the First Steps Project in Western Australia 



ACER: 

♦ Empirical Validation of the First Steps Reading Continuum 

♦ Empirical Validation of the First Steps Spelling and Writing Continua 

♦ Empirical Re-Validation of the First Steps Spelling Continuum 

♦ Assessment and Record of the Changes made to the Spelling Continuum 

♦ The Impact of First Steps on Schools and Teachers 

♦ The Impact of First Steps on the Reading and Writing Ability of Year 5 Students 

♦ Background: First Steps and the ACER Evaluation «& Report on the Validity of the First 
Steps Writing and Spelling Continua* 



EDWA: 

♦ Supporting Linguistic and Cultural Diversity Through First Steps: The Highgate Project 



For more information about on-going First Steps research, please contact: 

First Steps™ / Heinemann 
361 Hanover Street 
Portsmouth, NH 03801-3912 
1.800.541.2086, ext. 281 
firststeps@heienmann.com 
www.heinemann.com/firststeps 



In 1992 the Australian Council for Educational Research began an evaluation of 
aspects of the First Steps project. One part of this evaluation involved the empirical 
validation of the Spelling Developmental Continuum. Item Response Theory analyses 
were used to inform the this validation, and the subsequent revision of the Spelling 
Developmental Continuum. 

The validation found that the Spelling Developmental Continuum, in general, validly 
depicted the development of children's spelling competencies. However a number of 
minor, and two major problems were identified. The two major problems were that 
the indicators in the Transitional phase and the key indicators in the Phonetic phase 
were generally less difficult than the indicators and key indicators (respectively) in the 
developmentally earlier phases preceding them. 

In response to these and other less pressing problems that were identified, the First 
Steps project re-examined some of the indicators and proposed new wordings (and 
other changes) to try and make the meaning of particular indicators clearer and to give 
greater coherence to the Continuum as a whole. ACER was then asked to re-validate 
this newly worded Spelling Continuum. This re-validation was conducted in 
November 1993. The main findings from this re-validation can be found in the report 
Empirical re-validation of the First Steps Spelling Continuum. The re-validation showed 
that there had been a general improvement in the Continuum as a result of the 
changes made to the earlier version. The effect of these changes was not discussed in 
great detail in the Empirical re-validation of the First Steps Spelling Continuum report 
because the main aim of the report was to assess, in general, the new version of the 
Spelling Development Continuum. 

It was felt that a more systematic record of the changes made to the earlier version of 
the Spelling Development Continuum and the outcomes for each indicator that had 
been changed, was needed. Such a record would be of use to First Steps. It may also 
assist in evaluating Item Response Theory as a method for informing the revision of 
Developmental Continua such as those produced by First Steps. 

Throughout this report the earlier version of the First Steps Spelling Developmental 
Continuum is named the 1992 version. The Spelling Developmental Continuum with 
the changed indicators is known as the 1993 version. 

Aims of this report 

This report aims to describe, in detail, the changes made to the 1992 version of the 
Spelling Development Continuum, and to record the effect of these changes as 
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measured in the re-validation of the Continuum. It also aims to provide an assessment 
of these changes. 

The research questions 



The following specific questions are addressed in this report: 

• Which First Steps indicators with a poor Infit Mean Square^ in the 1992 version 
improved in the 1993 version of the Spelling Development Continuum? 

• Which First Steps indicators with a good Infit Mean Square in the 1992 version 
now have da poor value in the 1993 version of the Spelling Development 
Continuum? 

Which First Steps indicators with an extreme estimate of difficulty (compared with 
other indicators within the phase) in the 1992 version, improved in the 1993 
version of the Spelling Development Continuum?2 

• Which First Steps indicators with an appropriate estimate of difficulty in the 1992 
version are now outliers in the 1993 version of the Spelling Development 
Continuum? 

What patterns are there in the changes to the wording of indicators which may 
suggest an explanation for the changes observed in the 1993 version of the 
Spelling Development Continuum? 

• Overall, from the perspective of Item Response Theory, to what extent has the 

Spelling Developmental Continuum been improved upon with the rewording of 
indicators? 



The Sample 

The data for the 1992 validation of the Spelling Developmental Continuum were 
coUected from 39 teachers assessing 353 students. These teachers taught either Year 1, 
Year 3, Year 5 or Year 7 students. They were selected from the population of Western 
Australian government Primary and District High schools which were known to have 




^ A poor Infit Mean Square usually means that an indicator is ambiguously worded or not 
tapping spelling ability as well as it might. This is explained in more detail below. 

An indicator with an extreme difficulty (compared with other indicators within the phase) 

makes it difficult for teachers to use the Continuum. Again, this is discussed below in more 
detail. 
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been formally involved with implementing First Steps. For the 1993 re-validation of 
the Spelling Developmental Continuum 88 teachers assessed 843 students. These 
students came from Kindergarten, Years 1, 3, 5 or 7 from a sample of Western 
Australian government primary and District High schools. 

For more details of the sampling see the Empirical validation of the First Steps Spelling 
and Writing Continua and the Empirical Re-validation of the First Steps Spelling Continuum. 



The Data 

The data for the validations consisted of judgements made by teachers about the extent 
to which each of up to ten students in their class exhibited evidence of having 
demonstrated the behaviour described by each of a number of First Steps indicators. 

In this report, the data consist of the results of the validations. These data are found in 
the appends. (The appendix appears both as a separate document and as an 
attachment at the end of this report. This is done to facilitate access to the detail of the 
data while guarding against the possibility that a separate appendix may become lost.) 



Item Response Theory (Rasch Modelling) and the validation of the Spelling 
Developmental Continuum 

The objective of the empirical validation and the re-validation of the Spelling 
Developmental Continuum was to collect information about the typical order of 
behaviours exhibited by children as they learn to spell. This information was then 
compared with the order of behaviours proposed in the two versions of the Spelling 
Developmental Continuum. 

The data for the empirical validations of the Spelling Development Continuum were 
analysed using the computer program '(^uest' (Adams and Khoo, 1992) which 
produces Item Response Theory calibrations of indicators. 

The calibration process (based on the Rasch model) estimates a 'difficulty' level for 
each indicator. In general, the greater the number of students exhibiting the behaviour 
described by an indicator, the 'easier' (lower on the scale of difficulty) that indicator is 
estimated to be. In this way, the calibration process parallels the intention of First Steps 
by seeking to locate indicators at positions along a Continuum. 
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The calibration takes observational data supplied by teachers. These data consist of 
judgements made by teachers about whether their students do, or do not, exhibit the 
various behaviours described by the Spelling Developmental Continuum indicators. 
The calibration provides an estimate of difficulty for each indicator. 

Once having estimated the difficulty of the Spelling Developmental Continuum 
indicators, the validation of the Continuum could begin. The validation involved 
establishing how well the indicators, ordered by the calibration process matched the 
order of indicators proposed by First Steps. This task is referred to as the empirical 
testing of the validity of the Continuum. 

Exhibit l3 shows the calibrations for the indicators in the 1992 version of the Spelling 
Development Continuum. There were some problems for the Continuum revealed by 
these data. 

The 1992 version of the Spelling Continuum had a serious overlap between the 
Preliminary and Semi Phonetic Phases, a less serious but noticeable overlap between 
the Semi Phonetic and Phonetic Phases as well as a significant overlap between the 
Phonetic and Transitional Phases. The developmentally later Transitional Phase had, 
on average, indicators which were less difficult than the Phonetic Phase. 

In the 1993 version of the Spelling Development Continuum, the Transitional Phase 
still has, on average, easier indicators than the Phonetic Phase, however, of the 
overlaps between phases, only the one between the Phonetic and Transitional Phases 
remains. This can be seen in Exhibit 2. 

It was concluded in the Empirical Re-validation of the Spelling Development Continuum 
that, because of the improved relationships between the phases, the changes made by 
First Steps to the Spelling Development Continuum had led to an improvement. That 
is, phases were more clearly delineated and so teachers would be able to locate 
children more precisely and consistently using the revised version. 




^Exhibit 1, and each of the following Exhibits, shows the data using the median to represent 
the measure of central tendency and the interquartile range (the central 50% of the data around 
the median) to represent the spread. These measures are more resistant to the effect of outliers. 
The data in Exhibit 1 are displayed using a type of box plot. The lower boundary of the box 
identifies the value above which 75% of the estimates fall and the upper boundary of the box 
identifies the boundary below which 75% of the estimates fall. Thus, 50% of the estimates are 
located between the top and the bottom of the box. The horizontal line inside the box marks the 
location of the median. In some boxes the median is not located centrally. In these cases the 
data are concentrated on the side of the box which is closer to the median. The (horizontal) 
width of a box is a function of-the number of indicators in the phase. The more indicators there 
are in a phase, the wider is thei box. 
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Exhibit 1: Box Plots of difficulty estimates of indicators within phases of the 1992 
version of the First Steps Spelling Developmental Continuum 



4 - 
3 - 
2 - 

1 - 

3 
S 

Q 

-1 “ 

-2 - 
-3 - 

Preliminary Semi-Phonetic Phonetic Transitional Independent 




Exhibit 2: Box plots of difficulty estimates of indicators within phases of the 1993 
version of the First Steps Spelling Developmental Continuum 
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In First Steps, the location of a child within a phase is not established by the use of all 
indicators. There are certain indicators which have been defined as 'key' and it is these 
'key' indicators which are used to allocate a child to a phase of development. 
Accordingly, it is how these key indicators are spread within phases and how much 
overlap there is between phases for these key indicators which is critical for 
establishing how reliably children will be placed into a phase. Exhibit 3 shows the 
estimated difficulty of key indicators within phases for the 1992 version of the Spelling 
Developmental Continuum. Exhibit 4 shows the same for the 1993 version. An 
examination of these exhibits shows considerable improvement in the 1993 version. 

For the 1993 version of the Spelling Continuum there is far less overlap between 
phases. For example, the overlap between the Semi Phonetic and Phonetic Phases in 
the 1992 version (seen in Exhibit 3) is close to ideal in the 1993 version (Exhibit 4). 

There is also a more even spread of difficulty along the difficulty scale. This is 
particularly noticeable with the Semi Phonetic Phase. 



Exhibit 3. Box plots overlaid with data points of the difficulty estimates of key 
j^^^^^tors^ithin phases of the 1992 version of the First Steps Spelling Developmental 
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Exhibit 4: Box plots overlaid with data points of the difficulty estimates of key 
indicators within phases of the 1993 version of the Spelling Developmental 
Continuum. 
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There remains in the 1993 version, however, a considerable overlap between the key 
indicators of the Phonetic and the Transitional Phases. This means that teachers using 
these the key indicators from these two phases will not be able to properly identify 
children's level of development. 

Item Response Theory also provides estimates of the extent to which an indicator is, in 
the case of the Spelling Development Continuum, tapping spelling ability and how 
consistently teachers are able to interpret an indicator. The statistic describing this 
estimate is the 'In fit Mean Square'. 

A metaphor may help to explain this term. A First Steps Continuum can be likened to 
a path along which indicators are placed to act like milestones or sign posts. These 
milestones tell the teacher where the child is in his or her development. The Infit Mean 
Square might be likened to establishing how close these milestones are placed to this 
path. If the milestones are close to the path then their message is clear and 
unambiguous about a child's location. The further from the path the milestones lie, the 
more indistinct and ambiguous becomes their message. If they lay a long way from the 
path it may not be clear that they refer to this path at all but to another. 

A poor Infit Mean Square can also indicate that teachers cannot consistently interpret n 
indicator. Often the wording is found to be ambiguous. 

13 BEST COPY AVAILA 
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Assessment of changes to the 1992 version of the Spelling Developmental 
Continuum 

The assessment of the changes to the 1992 version of the Spelling Developmental 
Continuum begins by examining the differences in the number of indicators with a 
poor Infit Mean Square. It then examines differences in the number of indicators with 
>. an extreme estimate of difficulty compared with other indicators within the phase in 
which it is located. 



O 

ERIC 



Infit Mean Square 

The Infit Mean Square is important because it provides a measure of the coherence of 
the continuum. The fewer indicators with a poor Infit Mean Square, the more 

consistently is the continuum tapping, in the case of the Spelling Continuum, spelling 
ability. 



In the 1992 Spelling Development Continuum 15 indicators were identified with a 
poor Infit Mean Square. Of these 15 indicators, 2 were omitted from the 1993 version, 
10 had changes made to their text and 3 were left unchanged.^ 



The 15 indicators with a poor Infit Mean Square statistic in the 1992 version are: 



Preliminary Phase: 
Semi Phonetic Phase 
Phonetic Phase 
Transitional Phase 
Independent Phase 



102,103,106,444, 116, 118 
307,216 

302, 303, 304, 307, 311 

403, 406 

nil 



(where st rike through means the indicator was omitted from the 1993 version, and 
bold means that the indicator was reworded for the 1993 version). 



The 7 indicators with a poor Infit Mean Square statistic in the 1993 version are: 
• Preliminary Phase: 1 02, 1 03, 1 1 6, 

nil 

307. 308.320 
408 . 
nil 



Semi Phonetic Phase 
Phonetic Phase 
Transitional Phase 
Independent Phase 



(where underlined means a poor Infit Mean Square appears only in the 1993 version). 



The code numbers which follow are the numbers used for the 1992 version of the Spelling 
Development Continuum. Use the appendix to obtain the text of these indicators. Also use the 
app>endix to link the 1992 code numbers to the 1993 code numbers. Only the 1992 code 
numbers are used in the body of the report to avoid confusion. 
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The following 9 indicators have an improved Infit Mean Square in the 1993 version of 
the Spelling Development Continuum - 106, 118, 216, 302, 303, 304, 311, 403 and 406. 

Of the 10 indicators from the 1992 version which had their text changed, for the 1993 
version, 8 now have a satisfactory Infit Mean Square. Of these 10, 5 had changes made 
only to the examples attached to the stem of the indicator (216, 304, 307, 31 1 and 406). 
This led to an improvement for all but 307. Two indicators had changes made to the 
stem only (116, 118) which led to an improvement for 118 but not for indicator 116. 
Three indicators had changes made to both the stem and the examples (302, 303 and 
403) and all three indicators now have a satisfactory Infit Mean Square. 

The following 3 indicators have a poor Infit Mean Square in the 1993 version but did 
not have a poor Infit Mean Square in the 1992 version - 308, 320 and 408. One of these 
indicators (308) had text changed. Perhaps this change might be reviewed. 

Two of the 1992 indicators with a poor Infit Mean Square which were unchanged 
remain with a poor Infit Mean Square. These are indicators 102 and 103. These 
indicators would seem to require review, as they have now been shown, with two 
different sets of data collected at two points in time, to be not tapping spelling ability 
to a satisfactory extent. This is strong evidence that they are ambiguously worded and 
so teachers are unable to consistently interpret them. They probably require 
rewording. 

Two indicators which had their text changed remain with a poor Infit Mean Square. 
These are indicators 116 and 307. (Indicator 116 had the stem changed and 307 had 
examples changed.) Changing the wording has not improved these indicators. This 
suggests that these indicators may not tapping spelling ability because they are 
misconceived. It may mean that the wording of these indicators needs to be further 
changed as the changes for the 1993 version were not very extensive. Alternatively, 
consideration should be given to omitting these indicators from the Continuum. 

In general, rewording indicators with a poor Infit Mean Square has led to an 
improvement in the cohesion of the Spelling Development Continuum. There are 
fewer indicators with a poor Infit Mean Square in the 1993 version. (There were 15 in 
the 1992 version and 7 in the 1993 version.) 
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Estimates of Difficulty 

Before discussing the estimates of difficulty it is important to stress, that overall, the 
changes made to the Spelling Continuum have led to a marked improvement in it (as 
judged by an Item Response Theory perspective). There are, however, a similar 
number of extreme estimates of difficulty in both versions. This might suggest that 
there has not been much improvement. An examination of Exhibits 1 and 2, and 
Exhibits 3 and 4, shows that there has been an improvement. This apparent anomaly 
occurs because outliers are defined relative to the location of other indicators within 
phases. 



For the purposes of this report an indicator was judged to have an extreme estimate of 
difficulty if it fell on or beyond the lOth or 90th percentile of each Phase except for the 
Preliminary and the Independent Phases. In other words, an indicator has an extreme 
estimate of difficulty if it is too easy or too hard compares to other indicators for that 
phase.) For the Preliminary Phase only indicators with an estimated difficulty on or 
above the 90th percentile were defined as extreme. (That is, only indicators with 
relatively difficult estimates were defined as extreme in the Preliminary Phase.) For 
the Independent Phase, only those indicators with an estimated difficulty below the 
10th percentile (that is, relatively 'easy') were defined as extreme. It was felt that no 
Preliminary Phase indicator could be regarded as too easy nor any Independent Phase 
indicator too difficult because these Phases are at the extremes of the Spelling 
Developmental Continuum. (This was providing these phases did not have a large 
gap between them and their adjacent phases. There were no such gaps in the 1993 
version of the Spelling Developmental Continuum.) 

Typically, indicators with an extreme estimate of difficulty lie within, or occassionally 

beyond, the range of estimates of difficulty of the indicators within an adjoining phase 
or phases. 




There were 18 indicators with an extreme estimate of difficulty in the 1992 version of 
the Spelling Developmental Continuum. 



The 18 indicators with an extreme estimate of difficulty in the 1992 version are: 

• Preliminary Phase; 109,444,120 

• Semi Phonetic Phase 201,205,307,212, 



• Phonetic Phase 304, 307, 312, 322 

• Transitional Phase 401,403,413,415 

• Independent Phase 512,513,514, 

(where, again, stnkc through means the indicator was omitted from the 1993 version, 
and bold means that the indicator was reworded for the 1993 version ) 

16 
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The 16 indicators with an extreme estimate of 



• Preliminary Phase; 

• Semi Phonetic Phase 

• Phonetic Phase 

• Transitional Phase 

• Independent Phase 



101 . 120, 124. 
210. 216. 219 
304, 312, 312, 322 
401.409.411.415 
512, 514 



difficulty in the 1993 version are: 



(where under lined means an extreme difficulty estimate appears only in the 1993 
version.) 



The following 8 indicators, which had an extreme estimate of difficulty in the 1992 
version, show an improvement in the 1993 version - 109, 201, 205, 212, 307, 403, 413 
and 513. 

The following indicators did not have extreme estimates of difficulty in the 1992 
version, but do have in the 1993 version; 101, 124, 210, 216, 219, 317, 409 and 411. Of 
these indicators, 216, 317, 409 and 411 had their text changed for the 1993 version. The 
revised text may need to be re-examined for these indicators. 

Of the 18 indicators identified in the 1992 version of the Continuum 2, were left out of 
the 1993 version, 8 had their text changed and 8 were left unchanged. Of those 
indicators which had their text changed, 4 indicators show less extreme estimates of 
difficulty in the 1993 version. Of the 8 of indicators which were left unchanged, 4 
assumed less extreme values in the 1993 version. 

Of particular interest are those indicators which have extreme estimates of difficulty in 
both versions of the Spelling Continuum. These indicators should be carefully 
examined. The indicators 120, 322, 415 and 512, which are extreme in both versions of 
the Continuum, were not reworded. It is suggested that the wording of these 
indicators be examined. The indicators 304, 312, 401 and 514 have extreme estimates 
of difficulty despite rewording for the 1993 version. It may be that these indicators 
need to be moved to other phases or, perhaps, omitted from the Continuum. 
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In general, the changes to the 1992 version of the Spelling Developmental Continuum 
have not lead to much of a reduction in the number of indicators with an extreme 
estimate of difficulty. (The number has gone from 18 to 16.) However, there has been 
an overall improvement in the characteristics of the Spelling Developmental 
Continuum. This can be seen when comparing Exhibits 1 and 2 and, for the key 
indicators. Exhibits 3 and 4. This suggests that the rewording of the indicators has led 
to a better clustering of indicators within phases. It should be noted that this 
rewording has not removed the problem of the overlap between the Transitional and 
Phonetic Phases. 

17 
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A summary of the indicators which may need further revision in the 1993 version of 
the Spelling Developmental Continuum 

The following indicators should be revised or considered for exclusion because they 
have a poor Infit Mean Square in both the 1992 and 1993 versions of the SpelUng 
Developmental Continuum: 

102,103,116,307 

Indicators 102 and 103 had no changes made for the 1993 version, and so rewording 
may improve them. Indicators 116 and 307 remain a problem despite rewording. 

They may be candidates for exclusion from the continuum. As a minimum they 
should be reviewed for consistency of content with other indicators for that same 
phase, and checked for possible ambiguity. 

The following indicators have a poor Infit Mean Square in the 1993 version and if this 
version is to be adopted then these indicators may need to be further examined; 

308, 320, 408 

The following indicators have extreme estimates of difficulty in both the 1992 and 1993 
versions of the Spelling Developmental Continuum; 

120, 304, 312, 322, 401, 415, 512, and 514. 

All of these indicators had their wording changed. They have remained extreme 
despite these changes which suggests that they may represent skills that are more 
consistent with another phase. They may need, therefore, to be moved to another 
phase. Alternatively, they may need to be removed from the Continuum. 

The following indicators have extreme estimates of difficulty in the 1993 version. They 
may require further consideration; 

124, 210, 216, 219, 317, 409 and 411. 
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Conclusion 



In general, the changes made to the 1992 version of the Spelling Developmental 
Continuum have led to more coherence in the Continuum. It would be expected that 
as a result teachers would be able to use the Spelling Developmental Continuum more 
consistently and more easily. As well, despite there not being much difference in the 
_ number of indicators with an extreme estimate of difficulty in the 1992 and 1993 
versions of the Spelling Developmental Continuum, changes to the indicators have led 
to an improved ordering of the indicators. This improvement is especially important 
with the key indicators. Given that many of the changes in the 1992 version of the 
Spelling Developmental Continuum were as a result of findings from the empirical 
validation, the improvements seen in the 1993 version, may be attributed, at least in 
part, to the use of Item Response theory as a tool to inform this revision. 
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Appendix: Code numbers and text for Spelling Continuum indicators: 

1992 and 1993 version, identifying poor Infit Mean Square and extreme 
estimates of difficulty 

This Appendix lists the code numbers and text for both the 1992 and the 1993 
versions of the Sjjelling Developmental Continuum. It also provides information 
'■ about the Infit Mean Square and whether an indicator has an extreme estimate of 
difficulty. The data for the 1992 version come from the Empirical validation of the 
First Steps Spelling and Writing Continua report. The data for the 1993 version come 
from the Empirical re-validation of the First Steps Spelling Continuum report. 

The body of the appendix consists of a table with 9 columns. The table is broken up 
into five sections - one for each of the Phases within the Spelling Developmental 
Continuum. 

The first column of the table lists the code number for the indicators of the 1992 
version of the Continuum. These are the same numbers as those used in the Empirical 
validation of the First Steps Spelling and Writing Continua report. The table is ordered 
by these numbers. 

The second column lists the text of each of the indicators of the 1992 version of the 
Spelling Developmental Continuum. 

The third column describes how satisfactory the Infit Mean Square was estimated to 
be in the 1992 validation study. Cells in the column are left blank where the Infit 
Mean Square was satisfactory. The word 'Poor' indicates that this indicator was 
judged to not be satisfactorily tapping spelling ability, or was ambiguous. The 
words 'Left out' show that the indicator was omitted from the analysis because 
either all students or no students exhibited the behaviour described by the indicator. 
When this happens an estimate of difficulty cannot be calculated. (No indicators 
were 'left out' of the 1993 re-validation.) 

The fourth column describes how well the estimated difficulty of an indicator 
matched the difficulty of other indicators within the phase. Where a cell in this 
column is blank, the indicator has a similar degree of estimated difficulty to other 
indicators within its phase. The other two entries in this column are Too easy' and 
Too hard'. These entries point out that these indicators have an extreme estimate of 
difficulty when compared with other indicators within the phase. 




20 



The fifth column indicates if the text of an indicator from the 1992 version was 
changed. A blank cell means there was no change. 

The sixth column lists the code numbers for the indicators in the 1993 version of the 
Spelling Developmental Continuum. These are the code numbere used in the 
Empirical re-validation of the First Steps Spelling Continuum report. Occasionally the 
word 'omitted' appears in this column. This indicates that this indicator which 
appeared in the 1992 version was omitted from the 1993 version. (Note that this is 
different from those indicators which were 'left out' of the validation in 1992 
because all or no students exhibited the behaviours.) 

The seventh and eighth columns provide the same information as the third and 
fourth columns, except that the information comes from the 1993 re- validation. This 
information can be found in the Empirical re-validation of the First Steps Spelling 
Continuum report. 

The ninth column lists the text of an indicator which was changed for the 1993 re- 
validation. Indicators which appear for the first time in the 1993 version are also 
listed in this column. A blank cell in this column means that the wording of the 
indicator was the same in the 1992 and 1993 versions. 



Finally, it should be noted that not all changes made to the 1992 version of the 
SpeUing Developmental Continuum can be attributed to the ACER validation. All 
indicators which had text changed and had either a poor Infit Mean Square or an 
extreme estimate of difficulty, were changed as a consequence of the 1992 validation 
study. Some text was changed even though the validation found no problems with 
the indicator. For more detailed information, see Appendix D in the Empirical re- 
vahdation of the First Steps Spelling Continuum report, which lists all indicators 
changed as a consequence of the validation. 



The page numbers for the tables begin with page 6 to keep the numbering consistent loith 
the separately published Appendix, 



1992 Version of the Spelling Continuum 1993 Version of the Spelling ContInuiSm 
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Appendix A: Code numbers and text for Spelling Continuum indicators; 1992 and 1993 version, identifying poor Infit Mean Square and extreme estimates of difficulty 
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